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NON PARAMETRIC ESTIMATES OF OPTION PRICES USING 

SUPERHEDGING 


GIANLUCA CASSESE 

Abstract. We propose a new non parametric technique to estimate the CALL function based 
on the superhedging principle. Our approach does not require absence of arbitrage and easily 
accommodates bid/ask spreads and other market imperfections. We prove some optimal statistical 
properties of our estimates. As an application we first test the methodology on a simulated sample 
of option prices and then on the S&P 500 index options. 


1. Introduction 

A classical exercise in the econometric analysis of financial markets is to estimate option prices 
and the risk neutral probability (or its density) which is implicit in them, as suggested by the 
famous works of Breeden and Litzenberger [5] and of Banz and Miller [3]. Of special importance in 
this exercise is the use of non parametric techniques which have become popular in the last decades. 
Regardless of the econometric approach taken, a first step of crucial importance is purging the data 
from observations violating some no-arbitrage condition and thus conflicting with the conclusion 
that prices are the expected value of the asset discounted payoff computed with respect to the risk 
neutral probability, as assumed by Breeden and Litzenberger. 

Recalcitrant observations may however originate from different sources other than just pure mis¬ 
pricing. There are first some microstructural issues, such as the bid/ask spread or other transaction 
costs, which are not considered in the risk neutral approach. The common practice is to assume 
that these components are uncorrelated with the fundamental value of the assets and may therefore 
be disposed of with simple transformations, such as computing mid prices. Nevertheless, a more 
accurate investigation reveals that market makers often adjust the spread in response to pressure 
originating either from demand or supply, a fact that contributes to make the spread asymmetric 
and to induce a possibly significant correlation with the fundamental price. A second factor influ¬ 
encing market data is the existence of restrictions to trading, these too often overlooked in asset 
pricing models which typically assume the space of marketed claims to be linear. The extreme 
versions of these restrictions take the form of short selling prohibitions but even with less drastic 
market rules, the possibility of shorting assets is clearly delicate and subject to constraints such 
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as the provision of appropriate margins the effect of which receives in general little attention - if 
any - in theoretical and applied work on options. In fact, violations of the lower bound for CALL 
options, a mispricing that may be exploited by shorting the underlying, are often more frequent 
than others associated with strategies involving positions which are easier to take. Some additional 
noise arises with strategies which prescribe to invest simultaneously on different markets, a charac¬ 
teristic which results not only in relevant fixed costs but also in the lack of trade synchronism thus 
inducing additional risks in the execution of arbitrage strategies. A further issue arises eventually 
in connection with the riskless asset appearing in virtually all financial models and definitely in 
all empirical exercises. The assumption of a riskless asset and its explicit identification in applied 
work appears more and more counter factual as the interbank market turmoil in the years 2007-08 
and the sovereign debt crisis following it have clearly demonstrated. Not only during periods of 
crisis it cannot be assumed that bonds are riskless but it is also hard to assert that the implicit 
risks are uncorrelated with the equity market. 

Of course, the relevance of the preceding remarks would be much less disturbing if one could 
restrict attention to just the options market and consider only trading strategies which involve long 
positions. Unfortunately in this restricted perspective the fundamental premise of the Breeden 
Litzenberger exercise, i.e. that option prices are set on the basis of a risk neutral probability, 
cannot be assumed. Nevertheless, even if real option prices may fail to possess some fundamental 
properties, the superhedging price, conveniently computed will possibly satisfy them under mild 
conditions which involve only very simple and realistic trading strategies. The first step of our 
econometric approach consists of replacing original option prices with superhedging prices. 

As a matter of fact this idea is not new. Ait-Sahalia and Duarte [T] have inaugurated a two stage 
approach to non parametric estimation of option prices under shape restrictions which, in the first 
stage, prescribes to project market prices following a methodology adapted from Dykstra [8] . This 
procedure enforces the shape restrictions required. In the second stage, the prices so transformed 
are smoothed out using a local polynomial technique, a methodology which generalizes suitably 
the kernel by replacing constant functions with polynomials of arbitrary but preassigned degree. It 
turns out, as will be shown in the body of the paper, that the projection technique suggested by 
Dykstra produces just the options superhedging price, q^{k) - with k the strike price. The need for 
additional smoothing arises because the CALL function obtained, k —>• q^{k) is piecewise linear and 
therefore not informative enough for many a purpose, including the project to extract a probability 
density. 

In a recent paper, [6j, we have obtained a result which is particularly useful for the present 
purposes and that lays the ground for our non parametric procedure. It states, roughly put, that 
it is possible to compute explicitly the super hedging price q^{t) for a large class G of derivatives 
written on the same underlying and with payoff depending in a convex and decreasing way on 
a positive parameter, t. Not only, but for each such family G of derivatives it is possible to 
extract from the prices q^{t) a probability measure, implicit in them in much the same way 
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as suggested by Breeden and Litzenberger. A special case is of course the family corresponding 
to plain options but in general the choice of the family G is open. Our suggestion is to consider 
new derivatives with a payoff which, while being perfectly smooth, approximates the option payoff 
uniformly. Moreover, the degree of approximation should be made sample dependent so to obtain 
good asymptotic properties. As long as these properties are guaranteed, any candidate G is suitable, 
more or less as the functional form of kernels is relatively unimportant in comparison with the choice 
of the bandwidth. We have found it easy to work with splines but it should made clear from the 
outset that our use of splines has nothing to do with the econometric approaches based on this class 
of functions, such as those reviewed by Eubank [9] and successfully applied to options by Fengler and 
Hin [H]- Our problem in fact is not that of smoothing the option prices (or their implied volatility 
surface) but rather the option payoff. Splines simply turn out to be a conveniently tractable tool 
from a computational point of view; in addition, the smoothness/goodness of ht trade off may be 
tuned conveniently via a parameter acting as the bandwidth in kernel estimation. Other smoothing 
techniques may be employed, e.g. those based on some known probability density function. As 
a term of comparison, we shall briefly discuss the result obtained by smoothing options via the 
normal density rather than splines. 

The econometric analysis of option prices has grown over the years to become almost a field of its 
own and has been masterly reviewed by Garcia, Ghysels and Renault in m- The interested reader 
may find in their work an exhaustive list of references that we shall not try to improve upon here, for 
reasons of brevity. The non parametric approach has itself produced quite a number of important 
contributions that are worth discussing briefly with no pretension of completenes^ A forerunner 
of this stream of studies is the paper by Jackwerth and Rubinstein m in which the parameters of a 
binomial tree are set so as to minimize the distance between binomial and actual option prices given 
a penalty for deviations from an initial a priori distribution. The assumption of absence of arbitrage 
opportunities is absolutely crucial here and is embodied in the tree parameters. Estimating the 
option price produces simultaneously an estimate of the risk neutral measure. Ait-Sahalia and Lo 
[2] estimate the CALL function using the Nadaraya-Watson kernel and recover the risk-neutral 
density by computing the second derivative. This methodology has optimal asymptotic properties 
but relatively poor performance in small samples, a fact common to many non parametric methods 
and of special concern for options market. The curse of dimensionality problem, see |251 p. 675], 
requires to limit as much as possible the number of state variables. For example, the risk neutral 
density estimated in [2] via kernel smoothing is surely convergent to the true density but may fail 
even to be non negative in small samples [21 footnote 11, p. 508]. To circumvent this problem the 
authors propose a semi nonparametric approach in which the price is computed according to the 
Black and Scholes formula in which the volatility function is estimated non parametrically. This 
choice has the clear advantage of guaranteeing the correct shape of the CALL function. Shape 
restrictions are easily accommodated in parametric modeling but are much more troublesome in 

^ A nice review of non parametric methods and their relevance for economics is in Yatchew |25| . 
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the non parametric approach. Papers implementing the nonparametric methodology with shape 
restrictions are less numerous and more recent. Further to Ait-Sahalia and Duarte [T], another 
example of non parametric techniques incorporating shape restrictions is the paper by Yatchew and 
Hardle [26] who follow a least squares approach in Sobolev spaces. There exist eventually several 
papers who adopt spline techniques to estimate option prices and one should mention Fengler m, 
Fengler and Hin m and Yin, Wang and Qi m- 

We think that if our approach has any merit compared to the above references, then this lies 
in its clear financial interpretation. In fact we obtain the smoothness of the CALL function by 
pricing an appropriate derivative rather than performing some local averaging or implementing 
some other statistical techniqu^ Moreover, our method allows to take selection effects into full 
account, a fact not always clearly considered. Eventually the estimates produced are extremely 
tractable computationally speaking and have desirable convergence properties. 

The paper is structured as follows. In section we introduce the fundamental results obtained 
in [6] and needed in our exercise, together with the necessary notation. In section]^ we illustrate 
all details of our spline based technique and prove some of its properties. We also comment on the 
possibility of using given density functions. In section we perform some numerical experiments 
on option prices generated by an a priori model. In particular we construct non parametric interval 
estimates in the presence of a realistic structure of noise. Eventually in section we apply the 
proposed methodology to a sample of S&P option prices. All proofs are in the Appendix. 

Let us mention in closing that, although as dictated by the literature on this topic we start 
assuming the existence of a given probability space, (D,=^,P), the results developed in |6| do not 
require this classical premise. 


2. Option Pricing 

Most of this section is adapted from [6j. Let the random variable X represent hereafter the 
payoff of a given underlying and K{X) the set of strike prices (including A: = 0) of all CALL 
options written on it. We assume P{X > 0) = 1 and P{X < k) > 0 for all k G K{X) \ {0}|^ 
The portfolio consisting of one option with strike price k G K{X) and its price will be indicated 
by the symbols 6{k) and q{k) respectively. Moreover, its payoff will be denoted with the symbol 
g^{X) = {X — k)^. As we only consider long positions, q{k) will actually be the ask price and 
we shall assume that it is positive. The set 0 of admissible trading strategies is a convex set of 
portfolios formed by taking long positions in the set of traded options. The price and the payoff of 
9 = an9{kn) G 0 - so that ai,..., aj\f > 0 and fci,..., /cat G K{X) - are denoted respectively 


^ A partial exception is the XMM methodology proposed by Gagliardini et al. In which the GMM is applied 
with the additional constraint of reproducing a subset of given prices which are considered a priori to be correct. 

^ This corresponds to assuming that no quoted option is known to expire in the money with probability 1. 
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N N 

( 1 ) q{6) = o^nQjkn) and dei^) — ^OnSKX) 

n=l n=l 

The price function q does not include possible fixed trading costs. The apparent linear structure 
implicit in Q, notwithstanding the existence of bid/ask spreads (which many authors associate 
with subadditivity of prices, see El), is due to the restriction included in 0 that only long positions 
may be assumed and the traditional anonymity of option trading. OTC trading of options follows, 
as is well known, different pricing schemes, often ad hoc, and our theory does not apply to these 
transaction^ 

We define the superhedging price of an arbitrary random quantity /i ; 0 —)> M a^ 


(2) TT{h) =inf |Ag(0) : P{Xg^{X)/{X Al) > /i) = 1, A > 0, 0 G o} 

Write q^{k) = TT(^g^{X)/{X A 1)). We say that an option with strike price k is priced efficiently 
if q{k) = q^{k) and denote by K^{X) = {jo,..., j/} the set of strike prices for which we have an 
efficient price. It is easily shown that necessarily jo = 0. Remark that the notion of efficiency 
adopted here is particularly poor as it only involves investments with long positions in CALL 
options and the underlying. Neither PUT options. Futures nor bonds are contemplated. This is 
desirable since the larger the set of derivatives involved the more likely is it that efficiency may fail, 
as in the case of the PUT/CALL parity. 

We make the following 

Assumption 1. Let X = ess sup A and assume that 

(3) X > ji and esssup(A A z) = X A z for each z G M 


Denote by T the set of convex functions g : M+ —>■ M+ with g{fS) = 0 and limn^oo g{n)ln < oo. 
Clearly, the function g^ introduced above and corresponding to the option payoff is an element of 
T for all t > 0: write = { 5 '^ ; t > 0}. Of course, it is possible to write other derivatives on X 
possessing some of the properties of options. In the Appendix we prove the following: 


Theorem 1. For each / G T there exists 0{f) G 0 such that 7r(/(A)/(A A 1)) = q{0{f)) and 


( 4 ) 


4 

5 



(ji - jo) 

0 

0 

-1 

/(ji) 

d{o{f)) = te(jo),---,g(i/)] 

in - jo) 

ih-h) ■ 

0 


fUi) 


do{X) 

di{X) . 

. di{X) _ 


f{X) 


More details on the assumptions behind the present construction are found in [5]. 

In the functional tt is defined in much greater generality and no probability measure P is taken as exogenously 


given. 
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where di{X) = {X — ji) and f{X) = f{X) if X < oo or else di{X) = 1 and f{X) = lim„ f{n)/n. 
In particular, we can write 


( 5 ) 

Let Di{h) 


I 


I 


iW)) = E ^ ^ ^if (ji) +^/+l/(-^) 

2=0 2=1 


Jliiil when i = 0,1 and = Dj{h) = 0. Then, if X = oo we have 


(6a) wi = f{X) - -D/-i(/), bi+i = q{ji) 

( 6 b) Wi = Di{f) - Di_i{f) and 5^+1 = A+i(g) - A(g) i = 0,1 

The next result is just a restatement of [H Theorem 7]. 

Theorem 2. Let G = {gt : t G M+j be a collection of functions G T such that 

(7) ags + (1 — a)gu > gt whenever a G [0,1], s,t,u £ M+ and as + (1 — a)u < t 

Write q^{t) = TT(^gt{X)/{X A 1)). There exists (5^ > 0 and G ca(^(M+)) such that 

/ OO 

u^{x > z)dz for all t>0 


Observe that, by standard rules, 


(9) 


> z)dz 



t)^dn^{x) 


Jt J 

Thus Q, represents the price of the G derivatives as the sum of a bubble part and of their 
fundamental value. This view is substantiated by the observation that necessarily 


( 10 ) 


13^ = lim q^{t) 

t^OQ 


so that, upon choosing G = G^, the term represents the option price as the strike approaches 
infinity and contributes to explaining the overpricing of deeply out of the money CALL’s often 
documented empirically in some form of the smile effect. 

As remarked in [ 6 ], Breeden and Litzenberger formula applies, giving: 


( 11 ) 


v^{x > t) 


dq^{k) 


dk 


k=t 


for all t > 0 


while in the model of Black and Scholes one has, as is well known, 


( 12 ) 


n^^{x >k) = e-^'^<I>{d 2 ) with ds 


ln(S'o/A:) + (r - 1(t‘^)T 

aVr 


so that = exp(—rT). Observe that in our setting, we do not have an a priori restriction on 

the norm of because our market does not include any riskless bond. 
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3. Estimating the CALL function 


For what concerns the choice of G, it is natural to start considering the family G^. By construc¬ 
tion q^{k) represents the efficient price of the corresponding option and so it coincides with q{k) if 
and only if k £ K^{X). More generally one sees that the function q^ is the highest among the pos¬ 
itive, convex and decreasing curves passing through the knots {{j,q{j)) ■ j £ K^{X)}. Ai't-Sahalia 
and Duarte [T] obtain the same values in the first step of their approach. Since we have to make a 
choice for definiteness let’s assume henceforth 


Assumption 2. A = oo. 

Replace g with in Theorem Then, according to 
corresponding option is to invest 

ji+i - k 


), the cheapest way to superhedge the 


Wi{k) = 


and w^j^i{k) = 1 — w^{k) 


3i+l Ji 

in the options with strike prices ji and ji+i respectively, where jj+i > k > ji, and nothing in all 
other options; or else, if /c > jj, to buy one unit of the option with strike price jj. The corresponding 
superhedging price would be 


(13) 


f ^ q(ji) q(ji^i) if ji < k < ji^i 

1 qUi) if ji < k 


Observe that the CALL function k —)■ q^{k) satisfies: (i) q^{ji) = q{ji) for i = 0,..., / and {ii) it is 
a straight line on each interval [jj, jj+i] for i = 0,1 and on [ji, oo). Then it coincides with 
the projection obtained by following Dykstra’s method, see [U sec. 4.1r' 


(14) 


From (13) we deduce easily that 

i/°(x >k) = 


g(L)-g(L+i) 

ji + 1 ji 


if ji < k < jj+i 


^0 if jj < k 

In other words, the implied set function coincides - not surprisingly - with the derivative of the 
price function over the discrete set K^{X) upon a change of sign. 


Despite being a theoretically exact formula, the informational content of (14) is indeed quite 


poor empirically, as a consequence of the limited number of options which are priced efficiently by 
the market - the size of K^{X). In fact u°(x > t) remains constant within two adjacent strike 
prices in K^{X) or, equivalently, the efficient CALL function q^{k) is piecewise linear. Adding the 
fictitious efficient prices q^{k) to the original data set, thus, does not improve our knowledge of u. 

Another way of putting it is saying that superhedging of options is an intrinsically trivial exercise, 
a conclusion to which contribute two distinct factors. First, the piecewise linear nature of the payoff 
function is such that superhedging never requires more than two efficient options which makes the 


° In fact, Ai't-Shalia and Duarte have to adapt slightly the method of Dykstra since, assuming the lower bound 
for CALL options holds, the set unto which they project is not a convex cone but just a convex set. However, this 
additional restriction, for the reasons outlined above, does not apply here. See [T] p. 18 and Appendix A]. 
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corresponding price function locally linear, as in (13). On the other hand, the relative scarcity of 


available efficient prices makes the length of intervals on which the function is linear wider. If the 
size of the set K^{X) is a constraint in this problem and essentially depends on the market structure, 
a quick look at (Q reveals that smoothness of depends on the smoothness of the underlying 
family G. Given that obtaining smooth estimates is the goal of any econometric exercises, our 
first step is then to construct a family of derivatives written on X, each having a payoff - 
denoted by g^{X) - which is (i) conveniently close to the corresponding CALL option but (ii) 
twice continuously differentiable and such that {Hi) G^ satisfies 0 . These properties guarantee 
that the corresponding price, q^{k), will be a smooth estimate of the option prices and that it will 
admit the representation ([^ in terms of an implicit pricing measure. The class G^ of derivatives 
will depend on a control parameter, h > 0, which acts in much the same way as the bandwidth 
parameter in kernel regression. We give now a more detailed description. 

3.1. The Spline Approach. Let k,h > 0 and N € N he given. Divide the interval [—h,h] into 
N intervals of equal length, with endpoints = —h + 2ih/N and i = 0,...,N. Consider then 
the following functional (with D^f denoting the second derivative of / and the CALL payoff 
function with strike k as above): 

N 2 pk-\-h 

(15) Ik{h]g]X) = Y^\^gl{k + t'i) - g{k + t'i)^ ' ' . xn2 

and the program 


i=0 


+ A / \D‘^g{x)'\ dx 

Jk-h 


(16) 


Ik{h;\)= min 4(/i; 5 ; A) 
gern "^2 


It is well known, see m Theorem 5.2], that a solution to this problem is given by a cubic spline 
which is linear outside of [k — h,k + h]. Based on the fact that the second derivative of a cubic spline 


is locally linear, the infinite dimensional problem (16) conveniently reduces to a 2iV-dimensional 
one. Turlach [2l] developed a methodology to compute its solution under several shape restrictions 
such as (i) g, Dg, D^g > 0, i.e. positivity, monotonicity and convexity. To these constraints we add 
the following: {ii) g{x) = Dg{x) = 0 for x < A: — /i, {Hi) g{k + h) = g^{k + h) and (iv) Dg{x) = 1 
for X > k + h. Denote 


(17) 


x{k; /i) = {<7 G ‘^^(M+) : g cubic spline meeting the constraints (i) — (iu)} 


The notation adopted is consistent with the choice of treating A as a fixed parameter and of 
focusing exclusively on properties which depend on h. Although we experimented several values, in 
what follows we will set N = 10. More importantly, we make the choice of A endogenous by letting 
\h = (O.l/i)^ - so that Aio = 1. The existence of a constrained solution to (16) and the properties 
of such solution are proved in the next: 


Lemma 1. The problem 
(18) 


Ik{h) = min { 4 (/i; g] Xh) ■ g ^ x{k] h) ] 
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admits one and only one solution, G x{k', h), and this satisfies: (i) = g^{x — m) for all 

k,m,x > 0, (a) g^ < g^ < g^ whenever h' < h and (in) sup^, [g^ — g^) (x) = 0. 

Thus if we set 

( 19 ) G^ = {gt.k>0} CT 

the family satisfies 0 . Actually, in the proof of Lemma we obtain that for fixed feo, /iq > 0 

(20) g^ix) = g’^° (^ko + (x - k)^^ ^ for every k, h > 0 
so that property {in) of Lemmafollows easily from 


( 21 ) 


0 < q^{k) - q^{k) < tt 


( gl{X) - gl{X) \ 

V ^A1 


< 


g'iik) 


{k — h) /\1 


9(0) < 


ffr(fc)g(o) h_ 

{k — ho) A I ho 


Denote by G 0 the portfolio 0{g^) involved in super replicating g^ and by g{( its payoff. 

The content of Lemma is clearly illustrated in Figure Panels A and B, where the function 
g^ and its derivative are plotted for different values of h. 

[Figure 1 about here.] 

In Panel C the target payoff g^ is plotted together with the payoff g^ of the portfolio that super 
replicates it in order to understand the role of efficiency. By construction, superhedging g^ or g^ 
are entirely equivalent exercises. Thus, the degree of smoothness involved depends not only on the 
shape of the notional payoff g^ but also on that of its market counterpart, g^, and particularly on 
the number of contracts involved in superhedging. This remark highlights the role of the bandwidth 
h. A small value of this parameter makes the width of the smoothing interval \k — h,k + h] narrow 
and diminishes, as a consequence, the distance between g^ and g^, as shown in Lemma At the 
same time the number of efficiently priced options with strike price included in such interval - i.e. 
the size of the set [k — h,k + h]f^ K^{X) - is reduced, making the super hedging payoff g’f. less 
smooth. In fact, in Panel C - with 6 equally spaced efficient strike prices ranging from 95 to 105 
- the payoff g^ is represented by a continuous, smooth line while g^ is the piecewise linear curve 
of the same color dominating it. In the case h = 2, the super hedging portfolio only contains 2 
options while for the case h = 10 it contains all 6. It is clear from the picture that g{( becomes less 
smooth as it approaches g^, suggesting the need to balance these two countervailing effects. 

To understand this point better, let M be the mesh of K{X) \ {0}, i.e. 


(22) M = sup {\k-k'\:k, k' £ K{X) \ {0}, k ^ k'} 

and the mesh of K^{X) \ {0}. Let also w^{j) denote the quantity invested in option 6{j), with 
j G K^{X), in order to super hedge g{(, obtained from Q. From the fact that (/(] and g^ coincide 
outside of the interval [k — h,k + h] by 0 and from 0 we deduce the implications 


(23a) w’({ji) > 0 ^ ji+i + h > k > ji-i - h \k - ji\ < h + 


i = 
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(23b) 10^(0) > 0 => ji > k — h 

This suggests to set h = 5M so to make the properties of the estimator sample dependent. In the 
sequel we will replace the superscript h with 5. We observe that, as a consequence of pS] ), the 
number of efficient options employed in order to superhedge 5 ^^ is at most 3 + 26M/M^ so that it 
is desirable to fix 5 > /M > 1 in order to have at least 5 options available. In the following 
sections we will experiment values of 6 ranging from 1 to 10 . 


3.2. Properties of the Estimator. A different and more classical question is whether the pro¬ 
posed estimator converges to the true price function in the presence of disturbances. The classical 
formulation of this problem is 


(24) 


q{k) = F{k) + Sk k£K{X) 


where F is the model, and is thus assumed to be a positive, decreasing, convex, function, 
whereas the errors e* are identically and independently distributed with zero expectation. 
Theorem gives an explicit functional form to our estimator: 


(25) 


Q^k) = wi{j)q{j) 

ieA'O(A) 


Observe that, by (37) and Assumption]^ 


t > 0 


(26) wi{j) > 0 for all j G K^{X) and ^ wi{j) = I 

jeKO(x) 

At first sight, then, our estimator appears as an exemplification of the local averaging approach 
which includes, as additional special cases, kernels and regressograms. For this class of estimators 
a well established theory demonstrates (see e.g. [23 p. 677]) that the MSE converges to 0 as the 
sample size diverges. However, in local average estimators weights are assumed to be uncorrelated 
with disturbances and this is of crucial importance in proving convergence. In our setting, instead, 
this property can in no way be assumed to hold. In fact the set K^{X) of efficient option prices, 
hitherto treated as given, will in general strongly depend on prices and therefore on disturbances. 
Not only but the inclusion of a given price k in K^{X) will depend not only on the corresponding 
noise but also on all other disturbances since a high value for will make it more likely for q{k) 
to be efficient. The problem discussed provides a clear exemplification of the selection bias studied 
by Heckman m and it is to some extent surprising that its pervasive role in empirical option 
pricing has not been fully recognized. From the classical work of Heckman we learn, in fact, that 
selection effects may result in biased and inconsistent estimates unless correcting for a term which 
proxies the expected value of disturbances conditional on selection. The work of Heckman has been 
extended to the non parametric setting in a recent paper by Das et al. | 7 ]- Our aim is that of 
giving sufficient conditions for the MSE to converge to 0 when taking the selection effect fully into 
account. 
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The first step is to treat K^{X) as a random variable. Observe to this end that the inclusion 
k G K^{X) holds if and only if the ^ measurable event = {q{k) < q^{k)} occurs. In order 
to take randomness into full account we replace the symbol K^{X) with Ziuj) C K{X). Then, 
if z C K{X) we have that {Z = z} = Hfcez ^ We observe that although the value of 
depends crucially on Z, it is in fact deterministic once conditional on the event {Z = z}. 
It is thus natural to follow a two step procedure (similarly to Heckman and to Das et al.) by 
first conditioning all variables on the selection mechanism and then focusing on the unconditional 
properties. Denote by Pz{-) the conditional expectation given the event {Z = z}, for z C K{X) 
and Pz{-) = YIzcK(x) When needed we explicit 2 : as {0 = Jq < jf ... < jf^} and 

write 

P= 3f^{Z=z} 

zCK(X) 

Denote by Mz and Mz the mesh of the sets z \ {0} and Z \ {0} respectively. The variable T’z(e) 
will be our correction term. 


Theorem 3. Let the function F : 


in (24) be decreasing and convex and assume that 


F G with sup 3 ,g]i|_|_ |D^T(x)| < a. Assume moreover that 

(27a) E{erea) = {), Pzisr) = Pz{£a) and Pzie^Er) = Pzi^a'^r') for all a,T,a',TeZ 


(27b) 


(27c) 


E( sup Pz{\Sr\)] <00 and E( sup Pzil^a^rl)] <00 

V r£Z ' \ rr 'raV. / 


(J.tGZ 


lim E{M%) = 0 and lim P{ti > 0) = 0 
M-s>0 ^ ' M-s>0 


Then, for any compact interval I C M+ \ {0} 

(28) lim supE((q^(t) — F(t)Y^ =0 

M^O tar \ / 


Condition (27a) is conceptually akin to the assumption made by Das et al. that the conditional 
expectation of the disturbance given selection depends only on the propensity score, see [71 As¬ 
sumptions 2.1, 2.2 and 2.3]. Condition (27b) is easily satisfied if, e.g., disturbances are uniformly 
distributed. The the most delicate property is dehnitely (|27c|) as it requires a strong form of con¬ 


vergence to 0 of the mesh of K^{X). The proof of Theorem strongly relies on some properties of 
the smoothing spline proved in Lemma a fact emphasizing the role of splines. 

3.3. The density approach. As mentioned in the introduction, splines are not the only possible 
choice for smoothing option payoff. We present in this paragraph an alternative based on some 
given probability density function, satisfying the properties 


(29) 


u(j){u)du = 0 and / \u\(l){u)du < 00 


Denote by the distribution function associated with cf and consider the following function: 


(30) 


9k{P^) = -k)-h 


''{x—k)/h 


u4>{u)du 
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Lemma 2. Let cj) be a probability density function satisfying (29). Then the function 4>) defined 
in (30) satisfies properties (i), (ii) and (Hi) of Lemma^ 


(31) 


The proof of the Lemma is rather clear given the explicit form of the derivatives of g^{x', 4>), i.e. 

X — k\ , ,/x — k\l 




in which it is implicit the inequality 


0 < g^x] </>) - Qkix) < gk{k; 4) - gli^) <hj \u\(l){u)du 


(32) 


A special case of (30) is given by the standard normal distribution: 

X — k 


g^{x-,JT) = -k) + h4j^(^ 


X — k 

h 


that will be briefly considered in the applications that follow, as a term of comparison. 

A noteworthy implication of (30), emerging clearly from (31) and is the possibility to extract 
from the non parametric prices q^{k; 4) an implicit risk neutral density in closed form, that is 


(33) 


I I 

d’^ix- 4) = Y^ biD‘^g^.{x- 4>) = '^b^ 


i=l 


i=l 

71 


X- ji '\ I 
h ) h 


where the parameters bi are the positive coefficients in ([6||j The implicit risk neutral density belongs 
thus to a preassigned family of density mixtures without actually having to make this assumption 
but rather as the consequence of the exact market pricing of a specific class of derivatives, g^{4)- 
Not only, but the mixing parameters are fixed by the market so that in order to estimate the implied 
risk neutral density one only has to specify the bandwidth h. 

In the option pricing literature a lot of interest was raised by models which assume that the 
risk neutral density is a mixture of log normals. This modeling choice has been adopted, among 
others, by Ritchey [20], Melick and Thomas m, Soderlind and Svensson |22j and Soderlind [22] . 
In [22] , Soderlind lists among the advantages of this approach the closed form for option prices and 
the flexibility of this family of densities which easily accommodates for skewness and kurtosis of 
returns while avoiding the difficulties inherent in the non parametric approach. Although this is 


clearly not the focus of our work, we mention however the relative ease of (33) in which the weights 


bi are in fact fixed by the market and do not have to be estimated, which is typically a delicate 


step involving the EM algorithm. In the next section we confront the implicit density (33) with 


the true one when it is assumed that the returns follow a mixture of log normals to provide at least 
some evidence that our approach is a viable solution for those cases in which the assumption of a 
mixture of log normals is justified. 


In fact, when / = g^ix; (j>) and X = oo, then f{X) = 1. 















NON PARAMETRIC OPTION PRICES 


13 


4. Empirical Applications: Simulation Analysis 

In this section we are going to test our methodology using as our benchmark the classical model 
of Black and Scholes, augmented for smile effects, using exactly the same values as in Ait-Sahalia 
and Duarte [I] for ease of comparison. The current underlying price is set at 1,365, maturity to 3 
months, the interest rate at 4% and we assume no dividends. We consider 25 strike prices equally 
spaced between 1,000 and 1,700, so that M = 28. Volatility will be a linear, decreasing function 
of the strike, ranging from 40% to 20%. With these values, the option prices range from 378 to 0.9. 

4.1. Deterministic Analysis. As a first step in our analysis we investigate how close are the 
quantities q^{k), a^{k) and iy^{X > k) to the true values computed from the model, where a^{k) 
is the the level of volatility implicit in q^{k) according to the Black and Scholes formula. We 
experiment three possible values of the smoothing parameter, namely 6 = 2,5, 10. The results, in 
terms of deviations of estimated values from true ones, are plotted in Figure Panel A represents 
the price gap. Panel B the volatility gap and Panel C the gap in terms of cumulative probability. 
For all choices of 6, the distance between true and fictitious values appear to be rather smalj^ 
Deviations of the smooth price from the true one are indeed quite limited but contain a smile 
effect and increase for options out of the money. This phenomenon, quite limited, soon disappears 
when we move from <5 = 10 to <5 = 5. Deviations from the benchmark are more interesting when 
considering the implicit probability as the smooth functions lie below 0 up to 1,460 for all choices 
of (5. 

[Figure 2 about here.] 

4.2. Smoothness vs. Variance. The picture changes significantly as we introduce noise into the 
original model. To avoid arbitrariness we still follow Ai't-Sahalia and Duarte [T]. In particular we 
believe that noise has an implicit microstructural component that may be well captured by liquidity 
and the bid/ask spread. We proxy illiquidity of options via the factor 1 + 5| exp(rr)A:/5o — 1| which 
is 1 for options exactly at the money and gets as high as 2.3 for deeply in and out of the money 
options. We fix the basis spread to 5% of the option price with a floor at 50 cents and a cap at 3 
dollars. Noise is then modeled as a random variable which is uniformly distributed on an interval 
centered at the origin and with radius equal to the product of illiquidity and half of the spread. 

In Figure we draw the distance between the correct Black and Scholes price and its estimate 
q^(k) when prices are affected by errors, in the way described above. 

[Figure 3 about here.] 

The picture shows that the higher the value of 6 the more smooth is the resulting estimate while, 
at the same time, goodness of fit decreases. This is a classical finding in non parametric statistics 
but presents here some new feature. We observe first that non efficient option prices, corresponding 
to plain circles, have actually no impact on estimates. The corresponding superhedging prices are 

® Of course, the Black and Scholes implicit probability, , changes accordingly to incorporate the effect of the 
strike on volatility. 
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plotted as crosses (so that the superposition of a cross to a circle signifies that the corresponding 
strike is an element of K^{X)). 8 prices out of 24 are non efficient and, as can be seen, most of 
them are concentrated on the left-hand side corresponding to deeply ITM options. This is due 
to the combined effect of illiquidity and of the spread which, in our set-up as in the real world, 
magnifies noise for this segment of the options market. We remark that noise has an asymmetric 
effect. A positive shock, particularly in the presence of a strong microstructural multiplier, will 
make the corresponding price inefficient and thus has no impact on our estimates. If the shock 
is negative or small in magnitude then it is less likely to produce price inefficiency and will be 
included in the estimated values. Our method is subject to a limited underestimation error for 
those prices which are affected by errors in a more relevant way,as is the case here for options 
deeply ITM. Second, we notice that the higher is 6 the more upward shifted will be the fitting 
curve. This follows from the payoff being larger. Including a constant term to the estimated 
CALL function would indeed reduce this problem as long as the constant make take on either sign. 
This addition would correspond however, in its financial counterpart, to the possibility of taking 
long or short positions in the riskless asset, a modeling choice which, although completely standard 
in the literature, blatantly contrasts with our starting assumptions. 

4.3. Monte Carlo Simulation. In order to construct interval estimates and evaluate the statis¬ 
tical aspects of our approach we run 5,000 Monte Carlo simulations of the error terms. Our model 
takes the form 

(34) q{k) + el, k € K{X), s = 1,2,... ,5000 

where the error terms are distributed as described above. 



dITM 

ITM 

ATM 

OTM 

dOTM 

Total 

Nr 

6 

5 

5 

4 

5 

25 

mean 

52.12 

50.8 

30.5 

18.0 

15.32 

34.74 

min 

0.0 

0.0 

0.0 

0.0 

0.0 

12.0 

max 

83.33 

80.0 

80.0 

75.0 

60.0 

56.0 


Table 1: Percentage of inefficient prices for each market segment. 

The introduction of noise produces a number of arbitrage violations, as documented in the preceding 
Table 1. These violations amount on average to 34.74% of the sample but range up to 56% and 
are above 40% in the 20% of cases. 

For any simulation s = 1,..., 5000 we compute, via Theorem the corresponding CALL func¬ 
tion, q^’^{k), implied volatility , a^'^{k), and the associated probability for different values of 
h, although we only plot 5 = 5,10. For each strike price k, we then compute the mean and the 
quantiles of the simulated sample obtaining for each quantity of interest maximum and minimum 
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values, 90% and 95% confidence intervals, mean and standard deviation. The corresponding curves 
are plotted in Figure]^ for <5 = 5, and Figurefor S = 10. 

We notice that, although in the worst possible scenarios the most deeply ITM options may be 
overpriced or underpriced by almost 5 $ (when the price fixed by the model is however more than 
300 $), the mean pricing error is never larger than 2 $ and, for reasonably liquid options just a few 
cents. Second, illiquid options are on average underpriced because of the effect highlighted before 
by which our method makes mainly use of prices affected by negative error terms. This may be 
considered as an asymmetric smile effect resulting, however, from microstructural factors. 

[Figure 4 about here.] 

[Figure 5 about here.] 

It is noteworthy that for <5 = 5 the true value always falls inside the confidence interval even at 
the 90% level for all three variables considered, suggesting that our approach produces quite reliable 
predictions. One may also notice that the two confidence bounds are not symmetric, especially for 
options deep in the money, reflecting the same distortion noted above. For the case <5 = 10 the 
situation partly changes as the estimate of the option price becomes less precise and, in particular, 
the mean price exceeds the actual one by more than 50 cents for all options ATM or OTM. In 
addition, both lower confidence bounds are breached as frequently as 21.7% suggesting that the 
choice (i = 10 produces an increase in smoothness which results however in a significant pricing 
error. We investigated also the results relative to the intermediate values of 5 and we report the 
corresponding values of the mean squared error 

5,000 

(35) MSE = — -^ ^ {q^'^{k)-q{k)f 

y ) 5,000 X 24 ^ ^ ^ ^ 

keK(X) s=l 

which we report in Table 2. 



II 

N = 6 

II 

00 

N = 10 

(5 = 1 

1.93 

1.99 

2.02 

2.03 

6 = 2 

1.78 

1.87 

1.92 

1.95 

6 = 5 

1.61 

1.63 

1.72 

1.76 

J = 10 

6.17 

2.06 

1.61 

1.58 


Table 2. Values of the MSE for alternative parameter choices. 

In the literature there is greater emphasis on the implied risk neutral density rather than on the 
implicit probability so that the focus is actually on the second derivative of the CALL function. We 
do not have a special interest for this quantity here, partly because the presumption that a density 
actually exists has no financial basis. In part, however, it is our choice to work with cubic splines 
that limits our ability to explore densities. Second derivatives in fact exist but are piecewise linear, 
making the candidate density function not particularly interesting for applications. To obtain 
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smoothness of the implied risk neutral density one should perhaps adopt a different functional 
form than cubic splines such as splines of higher order (which are however much less tractable 
computationally speaking) or as the normal option smoother described in (32)^ Another possibility 
would be to apply to the density obtained some local smoothing technique. In the Monte Carlo 
analysis performed here, however, smoothness of the density function arises upon averaging across 
all simulations. In Figure we plot the estimated mean density function together with the one 
originated from model. The value for the MISE so obtained is 4.402242e — 07 and 4.318549e — 07, 
for (5 = 5 and 5 = 10 respectively. 

[Figure 6 about here.] 

4.4. A word on normal option smoothing. We have performed the above analysis also via 


the normal options smoothing formula (32) rather than splines. The resulting values for the MSE 
as well as of confidence levels are less satisfactory than those reported above. We claim, however, 
that,in models assuming that the risk neutral density is a mixture of log-normals, then the density 


approach described in (32) produces remarkable results. To this end we have considered three 
normal densities with instantaneous parameters /r = {0.027,0.033,0.049} and a = {0.3,0.1, 0.4} 
and weights a = {0.2,0.3,0.5}, respectively. We have used the same values for strike prices, 
underlying, maturity and interest rate as above. Given these valuej^ we obtain risk neutral prices 
for options with strike prices ranging from 1,000 to 1,700. From such prices we extract the implied 
risk neutral density by using (32) after setting h = 20. We plot in Figure]^ the conditional risk- 
neutral densities over the interval 1,050 — 1, 650. We stress that indeed the mixture of log-normals 
is rather fat tailed, as desired, but also that its non parametric estimate is indeed very close to it. 
The corresponding value for the MISE amounts to 3.34864e — 06. 

[Figure 7 about here.] 


5. Empirical Applications: Market Data 

Eventually, we consider an application to market data by selecting an arbitrary trading day, 21^* 
October 2010, on the S&P 500 options markelj^ We sample ask quotes at time intervals of one 
minute each and disregard quotes for which the reported ask size is below 100. On the subsample 
so obtained we have options quotes for 180 different strike prices - ranging from 50 to 2500 - 
and 12 possible maturities - from 22”''^ October 2010 to 22”’'^ December 2012. We focus on options 
expiring in November 2010 and their quotes at 12:06, 12:39 and 13:03, around the market downturn 
of Figure [8j 

® We have performed the Monte Carlo analysis described in this subsection also for option payoffs obtained via 
( |32[ |. Nevertheless the ontput we obtained, e.g. in terms of the MSE, is less satisfactory than the results illustrated 
here for cubic splines. 

In fact with the above values, X]I=i OiM* = r. 

We make use of the quote prices provided by CBOE Market Data Retrieval (MDR). The dataset contains, 
among other things, information on bid and ask prices and volumes. Data are sampled at a frequency higher than 1 
minute. 
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[Figure 8 about here.] 

At the three selected times and for the selected maturity there are 65, 53 and 67 efficiently quoted 
strike prices respectively out of 90, 75 and 99. We therefore have a relatively long cross section 
of strikes and an incidence of inefficient prices of 30% on average. The large number of available 
strikes is one of the advantages of working with quoted ask prices, as dictated by our model, rather 
than transaction prices. In the sample there is an overwhelming ratio of contracts ITM by 10% or 
more and virtually no OTM contract as reported in Table 3. 



Total 

dITM 

ITM 

ATM 

OTM 

dOTM 

Sample 

23.972 

45.53 

27.15 

27.22 

0.09 

0 

12:06 

90 

39.77 

35.23 

25.00 

0 

0 

12:39 

75 

52.31 

32.31 

15.38 

0 

0 

13:03 

99 

38.37 

31.39 

30.23 

0 

0 


Table 3: Contracts by Moneyness, St/K. 


We select a subsample of strikes ranging from 670 to 1,255. At 12:39 the lesser number of strikes 
quoted corresponds to a larger maximum interval between consecutive strikes, i.e. M = 70, while 
at the other moments strikes do not differ by more than 25 and 30, respectively. Thus for the 
typical choice (5 = 5 we expect to have a relatively poor performance at 12 : 39, due to the high 
value of M. In the following picture we plot for each time t the curves corresponding to the three 
distinct values of d = 2, 5,10. 


[Figure 9 about here.] 

In fact we clearly see from Figure|^that the performance of our estimates at 12 : 39 is quite poor 
due to the fact that there is just one quoted strike between 1075 and 1175. The price curve ends 
up being very smooth but overestimates actual prices by as much as 5$ for the value (5 = 5 while 
in the other selected instants the price gap never exceeds 50 cents for such a parameter choice. 

Eventually we plot, for the value <5 = 5, the implied conditional risk neutral probability and 

to capture the effect of the market downturn on and 


10 


density at 12 : 06 and 13 ; 03, Figure 

. As expected, the fall in the underlying price determines a more pessimistic view embodied in 
the implied risk neutral distribution. Of course, as explained above, the density is not a smooth 
quantity, due to our choice of working with cubic splines. 


[Figure 10 about here.] 

Eventually, we can use the above information to compute some measures of risk. In particular 
we consider a long position in a future contract expiring on November 2010, i.e. in a month. The 
future price is set according to the future/spot parity, F{t,T) = St exp{r(T — t)). As a proxy for the 
riskless rate r we use the 1 month LIBOR rate on that date, quoted at 3.96%. We then compute 
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the VaR and conditional VaR at a confidence level of 2% and 5% at the two instants of time. The 
values are reported in the following Table 4: 



2.5% 

5% 

VaR 12:06 

177.58 

110.08 

VaR 13:03 

163.94 

123.95 

CVaR 12:06 

183.00 

134.30 

CVaR 13:03 

171.26 

149.65 


Table 4: Measures of risk. 

These last remarks suggest the importance to investigate the dependence of u on the current value 
of the underlying, although outside of our interests here. Another issue that would be important 
to address is the time evolution of the pricing measure. 

Mathematical Appendix 

In this appendix we present some results which we used in the proofs of the main Theorems. 

Results from section [2] 

Proof of Theorem The claim essentially follows from j6l Lemmas 7 and 8] in which, however, 
it is assumed that x < oo. The proof given here is adapted from that one to cover the present setting. 
First of all, in superhedging a given claim, we can restrict to the options which are priced efficiently, 
i.e. whose strike is included in K^{X). Let jo = 0 and write jj^i = x, V x < oo or else > jj 
arbitrarily. gg{X) > f{X) for X = 0. Assume that gg{ji) < f{ji) — £ for some i = 1,...,/ + 1. By 
continuity there exists then <5 > 0 such that g^{X) < f{X) — e holds on the set {ji — 6 < X < ji\. 
However, P{ji—5 < X < ji) > {) since otherwise esssup(XAjj) < ji—6 in contrast with Assumption 
Thus gg(X) > f{X) a.s. implies that g^{ji) > f{ji) for z = 0,...,I + 1. If A = oo then by the 
fact that jj+i was chosen arbitrarily we deduce g^{x)/x > f{x)/x, i.e. 1 > f{X). 

Viceversa, if g^{ji) > f{ji) for j = 0,...,/ + 1, then for each uj G {X < X} and some choice of 
ji+i there exists i{co) G {0,1,..., /} such that, for some 0 < a < 1, < X{lj) < ji(^uj)+i and thus 

f{X{uj)) < a/(ji(^)) + (1 - a)/(ji(^)+i) < agl{jn^^)) + (1 - a)5r^(ji(^)+i) = gl{X{uj)) 

because / is convex and g^ is linear on each interval In other words, g^{X) > f{X) 

outside of {X > X} i.e. P-a.s. so that the inequality gg{X) > f{X) P-a.s. may be written in 
vector notation as 


Da > f 

where a is the vector of weights oi,..., a/ such that 6 = Ylil=i £ ©, D is the (1 +1) x (/ +1) 

matrix whose inverse appears in (|4]) and f = [/(ji),..., f{ji)J{X)f. 
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Define the vectors w, b G implicitly by letting 

(36) bjdj = q{ji) and bjej + bi = — g(jn+i) n = 0,— 1 

i=n 

with 6/ = 1 if X < oo or else e/ = 0 and 

(37) Wi = - fUn) n = 0,..., I — 1 and y^ Widi = f{X) 

t'o ^-+1 " 

Clearly, in and b satisfy Q. The following properties are easily established by induction: (i) b > 0 
(as jo, ... ,ji G K^{X)), (ii) w > 0 (as / G T), (in) b^D = and (in) w = D^^f. But then, 

= mill q^a = min b^Da > b^f = q^w 

{aeR(+bDa>f} {aeR(+bDa>f} 


□ 


Results from section [3] 


Proof of Lemma By a result of Turlach 
cubic spline of the form 


p. 85] the program (16) admits as its solution a 


N 

g'iix) = Y, 1 [k+tlk+diY ^ [fc+i?,oo) 

2=1 


X E 


where ti,, tjj+i as in the text and ^ is a polynomial of degree 3 for i = 1,..., It is clear 
from the constraints imposed to (16) that indeed G T. Moreover, these same constraints imply 
that g^{x) = g^ix) when x ^ [k — h, k + h] while 1 > Dg^{x) > 0 on \k — h,k + h]. Thus, g^ — g^ is 
increasing on (—oo, k] and decreasing afterwards, so that su];>^{g^ — g^){x) < {g^ — g^){k) = g^{k). 
Moreover, the (non empty) set of solutions is clearly convex and the functional Ik{h; f) is strictly 
convex in / so that the solution is necessarily unique. Define 

9 kix) = - x) + (x - k) x>0 

and observe that g^ G x(/c; h). Moreover, one deduces from (15) that 


AT+l 


h{h; g^) — Yj + (O.l/i) 


2=1 


X] + i?)] + X/ [9fc(fc + ^f) “ +{0.1h) 

t?<o t?>o 


Y^ +'^i) ~'^i] + [5fc(^ + ^i^)] + (o.l/l) 


pk-\-h 

' {D^9kix))^dx 
k—h 

/'k.-^h 

{D‘^9k{x)f dx 
{D'^9k{x)fdx 


• k—h 
f^k+h 


t ?>0 


C'*<0 


' k—h 


= h{h;gl) 


However, since the solution is unique, we have the symmetry relation 
(38) gk{x) = gk{2k - x) + {x - k) x>0 
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Let / G x(^; h), h > h' and define T : x(fc; h) —)> x{k; h') implicitly by letting 

Tf{x) = f (^k + {x-k)^'^ ^ 

Observe that T is one to one and onto and that D‘^Tf{x) = h/h'D‘^f{k + {x — k)h/h'). Thus, 


N+l 


2 = 1 


r-fc+Zl' 


4(h';r/) = U{k + t’l')-Tf{k + t’l') +i0.1h'f / {D^Tf{x)ydx 


k—h' 


7V+1 


i=i ^ ^ dk-h 

= {hyhfikih-,f) 

Thus / solves the program ( |16| ) relatively to h if and only if T/ solves it relatively to h'. By 
uniqueness we conclude that (20) holds. Let 0 < h' < h and observe that, if x < A; 

9k (y = 9kik + {x- k)h/h')h'/h < gl{x)h'/h < g^{x) 

a conclusion which extends to x > /c by ( |38[ ). This proves (ii). Given that 0 < Dg^(x) < 1 we 
conclude that 

0 < fffc'(x) - gl{x) < g^{k) = ^g^ik) 

so that g^ decreases to g^ uniformly in x. 

If y = k' — k £ M., then writing = oo and 
Af+l 

9kix) = + + y - (k' + ty) = Xk{x + y,k') 

2=1 

It is obvious that 7 ^(x+y; k’) = 7 ^(x; k) and that g^_^_y{x+y) = ^^(x). But then, for z = 1,..., A^+1, 
9t'{k' + — lk{k' + ; A:') = g^{k + — 7fc(^ + k) = g^{k + tf) — g^{k + y) 

and that 

pk+h i-k'+h 

/ [-C'^ 7 fc(a:; A:)]^fix = / [D‘^-f^{x + y-k')]^dx 

Jk-h Jk'-h 

Using the notation of ( [I^ , we conclude that 

Ik{h) = Ik{h;Xki-',k)) = 4+y(/i;7fc(-;A:')) > Ik'{h) A: > 0,y G M 

The same inequality holds after exchanging k for k' so that 7fc(-; ^0 9k' both splines of 

degree 3 solving (16) and thus coincide, by uniqueness. We conclude that g^_^_y{x) = g^{x — y) and 
thus, if 0 < a < 1 and k < aki + (1 — a)A; 2 , 

a5'fci(a^) + (1 - a)9k2iy = (^9k((^ + (ki - k)) + (1 - a)gj^(x + k 2 - k) 

> (x + aki + (1 - a)k 2 - k)) 

> 9k{x) 


proving (z). 


□ 
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Proof of Theorem Write 

(39) q^{k) = F{k) + ^ wi{T)[F{T) - F{k)\ + ^ wi{T)er = F{k) + Ai{Z) + ^ 

T^Z T^Z T^Z 


with Al{Z) = Y^rezwiiT)[F{T) - F{k)]. Then, by ( [27a| 

Pz[{qHk) - F{k)f) = Y, wi{a)wi{T)Pz[{F{a)-F{k) + e^){F{T)-F{k)+er)) 


(7 ,t£Z 

AiiZf + [PzieD - Pzis^Sr)] Y + PzieaSr) + 2Pz{er)Ai{Z) 

T&Z 


By (23a) we know that wf.{T) = 0 when \k — t\ > (1 + 5)Mz and r > 0. Using Taylor expansion 
with Lagrange remainder we get 


\Ai{Z)\<wimF{Q)-F{k)]+ Y wi{T)[DF{k)\T-k\ + 

rez\{0} 

-.2 




2 

< + \DF{k)\{5 + l)Mz + D^F{k){6 + 1)2m| + |(1 + 


D^F{k) + 


3! 

5 ]\ ^3 


D^F{Xr) 


where we made use of (23b). Under (27c) this implies 


\un^E[Ai{Zf + 2Pz{er)Ai{Z)) = hm L;((4(Z)2 + 2Pz(e04(^))l{n<fc-5M,}) = 0 

If i > 1 we have wf,{Ti) = — Di{gf^) (with rj+i = r/ + 1, conventionally) so that 

wiiTi) = Di{gl) - A-i(5fc) 

< Daiiri+i) - Dgiin-i) 

{k-Ti-if 3 


(convexity) 


= D‘^9k{k){Ti+i - n-i) + 


(Ti+i -kf . 




< 2D^gi{k)Mz + a[l + (1 + 6f]Ml 
and thus 

(40) Y^ — l{ri>fc-5Mz} + ‘2.D‘^gl{k)Mz + a[l + (1 + (5)^]M| 

T&Z 

The claim follows from 

^ra^E{{q^{k) - F{k)f'^ = ^ui^E([Pz{el) - Pz{eaSr)] Y^ii^f + Pzie^jer)) 

< lim £'(2supP2(eo-er) Ti<k- 6 MzU + 

IVl —¥ U N /T "T Z 


tGZ 


= 0 


□ 
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Figures 


Panel A: Option Payoff g^(X) 



Panel B: Option Payoff Derivative Dg^{X) 



Panel C: Smooth Superhedging 



Figure 1 . Smoothing Option Payoff by h. 

Plot of 5 ^( 3 :) (Panel A) and Dg^(x) (Panel B) for k — 100 and h — 2,5,10. In Panel C we draw g^~^^(x) (red 
dotted line) together with g^^^^ {X) (piecewise linear curves), the payoff of the portfolio superhedging it 
assuming to have 3, 5 or 9 equally spaced strike prices. 




















-0.002 -0.001 0.000 0.001 0.002 0.003 0.004 0.000 0.001 0.002 0.003 0.004 0.005 0.006 


Figures 


25 


Panel A: Option Price (k) 
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Panel B: Implied Volatility 
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Panel C: Probability {X > k) 
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Figure 2. Deviations from Black and Scholes with a Smile. 

cr = 0.4 - 0.2(fc - 1000)/700, /OK - /da da /dk and N = 10. 
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Figures 


5=1 - 5=2 . 5 = 5 - 5=10 



e 
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Figure 3. Smoothness vs. Variance. 

Estimating the model q{k) + Sk with = 10 and (5 = 1,2, 5, 10. Values are expressed in terms of their distance 
from the correct B&S price. Actual option prices are plotted as circles while the corresponding efficient prices are 
plotted as crosses. The two symbols are overwritten whenever the corresponding strike belongs to K^{X). 









0.20 0.25 0.30 0.35 0.40 0.45 


Figures 


27 


Panel A: Option Price 



Panel B: Volatility 



Panel C: Probability 



Figure 4. Simulated Simulated Confidence Intervals: = 10 and 6 = 5 

The confidence bands and mean were obtained after 5, 000 Monte Carlo simulations. 
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Figures 


Panel A: Option Price 
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Panel B: Volatility 



Panel C: Probability 



Figure 5. Simulated Confidence Intervals: iV = 10 and 5 = 10 

The confidence bands and mean were obtained after 5, 000 Monte Carlo simulations. 
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Panel A: <5 = 5 



Panel B: (5 = 10 



Figure 6. Implied risk-neutral average density: = 10 and (5 = 5,10 
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Figures 


Risk neutral density 



Figure 7. Risk-neutral densities for a mixture of log-normals. 

All curves represent a corresponding conditional density over the interval 1050 — 1650. The dashed lines are the 
starting log-normal densities and the black solid line their mixture. The red line is the density extracted from the 

option prices. MISE — 3.34864e — 06. 
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Figures 


Figure 9 


Panel A: Option Prices at 12:06 
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Panel B: Option Prices at 12:39 
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Panel C: Option Prices at 13:03 



Actual and estimated option prices as differences with the efficient 
prices. Market prices are represented as circles. 
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Panel A: Implied risk neutral probability 



Panel B: Implied risk neutral density 



Figure 10. Conditional risk neutral distribution, > x), and density, d^{x), 
at 12:06 and 13:03 for the case <5 = 5. 






